About

Overview

The cran-search project aims to provide a database to perform a package search for the R programming language in the Comprehensive R Archive Network (CRAN) repository. The collected data are obtained by the tools::CRAN_package_db() function and selected only a few columns to perform the search for the topic of interest.

In the following table, it is possible to verify a brief structure of the data frame collected with packages available in CRAN. For example, the number of rows and columns, and the frequency of words longer than 3 or 4 characters for the column named title, description, and license. A depth investigation of the data is at the discretion of the reader.

update structure information
2023-01-14 column update, package, version, license, title, description, date, depends, import, url
2023-01-14 n_column 10
2023-01-14 n_row 19057
2023-01-14 NA TRUE
2023-01-14 title frequency: (1) data 3088 (48.06%), (2) analysis 1950 (30.35%), (3) with 1387 (21.59%)
2023-01-14 description frequency: (1) data 12170 (42.31%), (2) with 8374 (29.11%), (3) package 8221 (28.58%)
2023-01-14 license frequency: (1) license 4795 (49.84%), (2) file 4406 (45.80%), (3) lgpl 420 (4.37%)

Author

Author
name url
author Bruno Faria
website https://brunofariadf.github.io/
github https://github.com/brunofariadf/
Project
name url
main cran-search
review News
license MIT